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Input Set : A:\Sequence Listing.txt 
Output Set: N:\CRF3\03262002\J092880.raw 

<110> APPLICANT: Barenkamp, Stephen J. 

<120> TITLE OF INVENTION: HIGH MOLECULAR WEIGHT SURFACE PROTEINS OF NON-TYPEABLE 

HAEMOPHILUS 
<130> FILE REFERENCE: 

<140> CURRENT APPLICATION NUMBER: US/10/092 r 880 
<141> CURRENT FILING DATE: 2002-03-08 

<150> PRIOR APPLICATION NUMBER: 09/155,614 

<151> PRIOR FILING DATE: 1998-09-30 

<150> PRIOR APPLICATION NUMBER: 08/617,697 

<151> PRIOR FILING DATE: 1996-04-01 

<150> PRIOR APPLICATION NUMBER: PCT/US97/04707 

<151> PRIOR FILING DATE: 1997-04-01 

<160> NUMBER OF SEQ ID NOS : 11 

<170> SOFTWARE: Patentln Ver. 2.1 

<210> SEQ ID NO: 1 

<211> LENGTH: 5116 

<212> TYPE: DNA 

<213> ORGANISM: Haemophilus influenzae 
<400> SEQUENCE: 1 

acagcgttct cttaatacta gtacaaaccc acaataaaat atgacaaaca acaattacaa 60 
cacctttttt gcagtctata tgcaaatatt ttaaaaaata gtataaatcc gccatataaa 120 
atggtataat ctttcatctt tcatctttca tctttcatct ttcatctttc atctttcatc 180 
tttcatcttt catctttcat ctttcatctt tcatctttca tctttcatct ttcatctttc 240 
acatgccctg atgaaccgag ggaagggagg gaggggcaag aatgaagagg gagctgaacg 300 
aacgcaaatg ataaagtaat ttaattgttc aactaacctt aggagaaaat atgaacaagc 360 
tatatcgtct caaattcagc aaacgcctga atgctttggt tgctgtgtct gaattggcac 420 
ggggttgtga ccattccaca gaaaaaggca gcgaaaaacc tgctcgcatg aaagtgcgtc 480 
acttagcgtt aaagccactt tccgctatgt tactatcttt aggtgtaaca tctattccac 540 
aatctgtttt agcaagcggc ttacaaggaa tggatgtagt acacggcaca gccactatgc 600 
aagtagatgg taataaaacc attatccgca acagtgttga cgatatcatt aattggaaac 660 
aatttaacat cgaccaaaat gaaatggtgc agtttttaca agaaaacaac aactccgccg 720 
tattcaaccg tgttacatct aaccaaatct cccaattaaa agggatttta gattctaacg 780 
gacaagtctt tttaatcaac ccaaatggta tcacaatagg taaagacgca attattaaca 840 
ctaatggctt tacggcttct acgctagaca tttctaacga aaacatcaag gcgcgtaatt 900 
tcaccttcga gcaaaccaaa gataaagcgc tcgctgaaat tgtgaatcac ggtttaatta 960 
ctgtcggtaa agacggcagt gtaaatctta ttggtggcaa agtgaaaaac gagggtgtga 1020 
ttagcgtaaa tggtggcagc atttctttac tcgcagggca aaaaatcacc atcagcgata 1080 
taataaaccc aaccattact tacagcattg ccgcgcctga aaatgaagcg gtcaatctgg 1140 
gcgatatttt tgccaaaggc ggtaacatta atgtccgtgc tgccactatt cgaaaccaag 1200 
gtaaactttc tgctgattct gtaagcaaag ataaaagcgg caatattgtt ctttccgcca 1260 
aagagggtga \agcggaaatt ggcggtgtaa tttccgctca aaatcagcaa gctaaaggcg 1320 
gcaagctgat gattacaggc gataaagtca cattaaaaac aggtgcagtt atcgaccttt 1380 
caggtaaaga agggggagaa acttaccttg gcggtgacga gcgcggcgaa ggtaaaaagg 1440 
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56 gcattcaatt agcaaagaaa acctctttag aaaaaggctc aaccatcaat gtatcaggca 1500 

57 aagaaaaagg cggacgcgct attgtgtggg gcgatattgc gttaattgac ggcaatatta 1560 

58 acgctcaagg tagtggtgat atcgctaaaa ccggtggttt tgtggagacg tcggggcatg 1620 

59 atttattcat caaagacaat gcaattgttg acgccaaaga gtggttgtta gacccggata 1680 

60 atgtatctat taatgcagaa acagcaggac gcagcaatac ttcagaagac gatgaataca 1740 

61 cgggatccgg gaatagtgcc agcaccccaa aacgaaacaa agaaaagaca acattaacaa 1800 

62 acacaactct tgagagtata ctaaaaaaag gtacctttgt taacatcact gctaatcaac 1860 

63 gcatctatgt caatagctcc attaatttat ccaatggcag cttaactctt tggagtgagg 1920 

64 gtcggagcgg tggcggcgtt gagattaaca acgatattac caccggtgat gataccagag 1980 

65 gtgcaaactt aacaatttac tcaggcggct gggttgatgt tcataaaaat atctcactcg 2040 

66 gggcgcaagg taacataaac attacagcta aacaagatat cgcctttgag aaaggaagca 2100 

67 accaagtcat tacaggtcaa gggactatta cctcaggcaa tcaaaaaggt tttagattta 2160 

68 ataatgtctc tctaaacggc actggcagcg gactgcaatt caccactaaa agaaccaata 2220 

69 aatacgctat cacaaataaa tttgaaggga ctttaaatat ttcagggaaa gtgaacatct 2280 

70 caatggtttt acctaaaaat gaaagtggat atgataaatt caaaggacgc acttactgga 2340 

71 atttaacctc cttaaatgtt tccgagagtg gcgagtttaa cctcactatt gactccagag 2400 

72 gaagcgatag tgcaggcaca cttacccagc cttataattt aaacggtata tcattcaaca 2460 

73 aagacactac ctttaatgtt gaacgaaatg caagagtcaa ctttgacatc aaggcaccaa 2520 

74 tagggataaa taagtattct agtttgaatt acgcatcatt taatggaaac atttcagttt 2580 

75 cgggaggggg gagtgttgat ttcacacttc tcgcctcatc ctctaacgtc caaacccccg 2640 

76 gtgtagttat aaattctaaa tactttaatg tttcaacagg gtcaagttta agatttaaaa 2700 

77 cttcaggctc aacaaaaact ggcttctcaa tagagaaaga tttaacttta aatgccaccg 2760 

78 gaggcaacat aacacttttg caagttgaag gcaccgatgg aatgattggt aaaggcattg 2820 

79 tagccaaaaa aaacataacc tttgaaggag gtaacatcac ctttggctcc aggaaagccg 2880 

80 taacagaaat cgaaggcaat gttactatca ataacaacgc taacgtcact cttatcggtt 2940 

81 cggattttga caaccatcaa aaacctttaa ctattaaaaa agatgtcatc attaatagcg 3000 

82 gcaaccttac cgctggaggc aatattgtca atatagccgg aaatcttacc gttgaaagta 3060 

83 acgctaattt caaagctatc acaaatttca cttttaatgt aggcggcttg tttgacaaca 3120 

84 aaggcaattc aaatatttcc attgccaaag gaggggctcg ctttaaagac attgataatt 3180 

85 ccaagaattt aagcatcacc accaactcca gctccactta ccgcactatt ataagcggca 3240 

86 atataaccaa taaaaacggt gatttaaata ttacgaacga aggtagtgat actgaaatgc 3300 

87 aaattggcgg cgatgtctcg caaaaagaag gtaatctcac gatttcttct gacaaaatca 3360 

88 atattaccaa acagataaca atcaaggcag gtgttgatgg ggagaattcc gattcagacg 3420 

89 cgacaaacaa tgccaatcta accattaaaa ccaaagaatt gaaattaacg caagacctaa 3480 

90 atatttcagg tttcaataaa gcagagatta cagctaaaga tggtagtgat ttaactattg 3540 

91 gtaacaccaa tagtgctgat ggtactaatg ccaaaaaagt aacctttaac caggttaaag 3600 

92 attcaaaaat ctctgctgac ggtcacaagg tgacactaca cagcaaagtg gaaacatccg 3660 

93 gtagtaataa caacactgaa gatagcagtg acaataatgc cggcttaact atcgatgcaa 3720 

94 aaaatgtaac agtaaacaac aatattactt ctcacaaagc agtgagcatc tctgcgacaa 3780 

95 gtggagaaat taccactaaa acaggtacaa ccattaacgc aaccactggt aacgtggaga 3840 

96 taaccgctca aacaggtagt atcctaggtg gaattgagtc cagctctggc tctgtaacac 3900 

97 ttactgcaac cgagggcgct cttgctgtaa gcaatatttc gggcaacacc gttactgtta 3960 

98 ctgcaaatag cggtgcatta accactttgg caggctctac aattaaagga accgagagtg 4020 

99 taaccacttc aagtcaatca ggcgatatcg gcggtacgat ttctggtggc acagtagagg 4080 

100 ttaaagcaac cgaaagttta accactcaat ccaattcaaa aattaaagca acaacaggcg 4140 

101 aggctaacgt aacaagtgca acaggtacaa ttggtggtac gatttccggt aatacggtaa 4200 

102 atgttacggc aaacgctggc gatttaacag ttgggaatgg cgcagaaatt aatgcgacag 4260 

103 aaggagctgc aaccttaact acatcatcgg gcaaattaac taccgaagct agttcacaca 4320 

104 ttacttcagc caagggtcag gtaaatcttt cagctcagga tggtagcgtt gcaggaagta 4380 
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105 ttaatgccgc caatgtgaca ctaaatacta caggcacttt 

106 acattaatgc aaccagcggt accttggtta ttaacgcaaa 

107 cagcattggg taaccacaca gtggtaaatg caaccaacgc 

108 tcgcgacaac ctcaagcaga gtgaacatca ctggggattt 

109 atatcatttc aaaaaacggt ataaacaccg tactgttaaa 

110 aatacattca accgggtata gcaagcgtag atgaagtaat 

111 agaaggtaaa agatttatct gatgaagaaa gagaagcgtt 

112 ctgtacgttt tattgagcca aataatacaa ttacagtcga 

113 ccagaccatt aagtcgaata gtgatttctg aaggcagggc 

114 gcgcgacggt gtgcgttaat atcgctgata acgggcggta 

115 tagatttcat cctgcaatga agtcatttta ttttcgtatt 

116 ttcagtacgg gctttaccca tcttgtaaaa aattacggag 

117 acaggttatt attatg 

120 <210> SEQ ID NO: 2 

121 <211> LENGTH: 1536 

122 <212> TYPE: PRT 

123 <213> ORGANISM: Haemophilus influenzae 

125 <400> SEQUENCE: 2 

126 Met Asn Lys lie Tyr Arg Leu Lys Phe Ser Lys 

127 1 5 10 

129 Val Ala Val Ser Glu Leu Ala Arg Gly Cys Asp 

130 20 25 

132 Gly Ser Glu Lys Pro Ala Arg Met Lys Val Arg 

133 35 40 

135 Pro Leu Ser Ala Met Leu Leu Ser Leu Gly Val 

136 50 55 

138 Ser Val Leu Ala Ser Gly Leu Gin Gly Met Asp 

139 65 70 J 75 

141 Ala Thr Met Gin Val Asp Gly Asn Lys Thr He 

142 85 ~ ^ 90 

144 Asp Ala He He Asn Trp Lys Gin Phe Asn He 

145 100 105 

147 Val Gin Phe Leu Gin Glu Asn Asn Asn Ser Ala 

148 115 120 

150 Thr Ser Asn Gin He Ser Gin Leu Lys Gly He 

151 130 135 
153 Gin Val Phe Leu He Asn Pro Asn Gly He Thr 



aactaccgtg 
agacgctgag 
aaatggctcc 
aatcacaata 
aggcgttaaa 
tgaagcgaaa 
agctaaactt 
tacacaaaat 
gtgtttctca 
gcggtcagta 
atttactgtg 
aatacaataa 



aagggttcaa 
ctaaatggcg 
ggcagcgtaa 
aatggattaa 
attgatgtga 
cgcatccttg 
ggagtaagtg 
gaatttgcaa 
aacagtgatg 
attgacaagg 
tgggttaaag 
agtattttta 



4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5116 



154 145 



Arg Leu Asn 

His Ser Thr 
30 

His Leu Ala 
45 

Thr Ser He 
60 

Val Val His 

He Arg Asn 

Asp Gin Asn 
110 

Val Phe Asn 

125 
Leu Asp Ser 
140 

lie Gly Lys 



150 



155 



156 He He Asn Thr Asn Gly Phe Thr Ala Ser Thr Leu Asp He 



157 



165 



170 



159 Glu Asn He Lys Ala Arg Asn Phe Thr Phe Glu 



160 



180 



185 



162 Ala Leu Ala Glu He Val Asn His Gly Leu He 

163 195 200 

165 Gly Ser Val Asn Leu He Gly Gly Lys Val Lys 

166 210 215 

168 Ser Val Asn Gly Gly Ser He Ser Leu Leu Ala 

169 225 230 235 
171 He Ser Asp He He Asn Pro Thr He Thr Tyr Ser He Ala 



Gin Thr Lys 
190 

Thr Val Gly 

205 
Asn Glu Gly 
220 

Gly Gin Lys 



Ala Leu 

15 
Glu Lys 

Leu Lys 

Pro Gin 

Gly Thr 
80 

Ser Val 
95 

Glu Met 

Arg Val 

Asn Gly 

Asp Ala 
160 
Ser Asn 
175 

Asp Lys 

Lys Asp 

Val He 

He Thr 
240 
Ala Pro 
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172 



245 



250 



255 



174 


Glu 


Asn 


GlU 


Ala 


Val 


Asn 


Leu 


Giy 


Asp 


lie 


Phe 


Ala 


Lys 


Gly 


Gly 


Asn 


175 








260 










265 










270 






177 


lie 


Asn 


Val 


Arg 


Ala 


Ala 


Thr 


lie 


Arg 


Asn 


Gin 


Gly 


Lys 


Leu 


Ser 


Ala 


178 






275 










280 










285 








180 


Asp 


Ser 


Val 


Ser 


Lys 


Asp 


Lys 


Ser 


Gly 


Asn 


lie 


Val 


Leu 


Ser 


Ala 


Lys 


181 




290 










295 










300 










183 


Glu 


Gly 


Glu 


Ala 


Glu 


He 


Gly 


Gly 


Val 


He 


Ser 


Ala 


Gin 


Asn 


Gin 


Gin 


184 


305 










310 










315 










320 


186 


Ala 


L ys 


Gly 


Gly 


Lys 


Leu 


Met 


lie 


Thr 


Gly 


Asp 


Lys 


Val 


Thr 


Leu 


Lys 


187 










325 










330 










335 




189 


Thr 


Gly 


Ala 


Val 


He Asp 


Leu 


Ser 


Gly 


Lys 


Glu 


Gly 


Gly 


Glu 


Thr 


Tyr 


190 








340 










345 










350 






192 


Leu 


Gly 


Gly 


Asp 


Glu Arg 


Gly 


Glu 


Gly 


Lys 


Asn 


Gly 


lie 


Gin 


Leu 


Ala 


193 






355 










360 










365 








195 


Lys 


Lys 


Thr 


Ser 


Leu 


Glu 


Lys 


Gly 


Ser 


Thr 


lie 


Asn 


Val 


Ser 


Gly 


Lys 


196 




370 










375 










380 










198 


Glu 


Lys 


Gly 


Gly 


Arg 


Ala 


lie 


Val 


Trp 


Gly 


Asp 


He 


Ala 


Leu 


He 


Asp 


199 


385 










390 










395 










400 


201 


Gly 


Asn 


He 


Asn 


Ala 


Gin 


Gly 


Ser 


Gly 


Asp 


lie 


Ala 


Lys 


Thr 


Gly 


Gly 


202 










405 










410 










415 




204 


Phe 


Val 


Glu 


Thr 


Ser Gly 


His 


Asp 


Leu 


Phe 


lie 


Lys 


Asp 


Asn 


Ala 


lie 


205 








420 










425 










430 






207 


Val 


Asp 


Ala 


Lys 


Glu 


Trp 


Leu 


Leu 


Asp 


Phe 


Asp 


Asn 


Val 


Ser 


lie 


Asn 


208 






435 










440 










445 








210 


Ala 


Glu 


Thr 


Ala 


Gly Arg 


Ser 


Asn 


Thr 


Ser 


Glu 


Asp 


Asp 


Glu 


Tyr 


Thr 


211 




450 










455 










460 










213 


Gly 


Ser 


Gly 


Asn 


Ser 


Ala 


Ser 


Thr 


Pro 


Lys 


Arg 


Asn 


Lys 


Glu 


Lys 


Thr 


214 


465 










470 










475 










480 


216 


Thr 


Leu 


Thr 


Asn 


Thr 


Thr 


Leu 


Glu 


Ser 


lie 


Leu 


Lys 


Lys 


Gly 


Thr 


Phe 


217 










485 










490 










495 




219 


Val 


Asn 


He 


Thr 


Ala 


Asn 


Gin 


Arg 


lie 


Tyr 


Val 


Asn 


Ser 


Ser 


He 


Asn 


220 








500 










505 










510 






222 


Leu 


Ser 


Asn 


Gly 


Ser 


Leu 


Thr 


Leu 


Trp 


Ser 


Glu 


Gly 


Arg 


Ser 


Gly 


Gly 


223 






515 










520 










525 








225 


Gly 


Val 


Glu 


He 


Asn 


Asn 


Asp 


He 


Thr 


Thr 


Gly 


Asp 


Asp 


Thr 


Arg 


Gly 


226 




530 










535 










540 










228 


Ala 


Asn 


Leu 


Thr 


He 


Tyr 


Ser 


Gly 


Gly 


Trp 


Val 


Asp 


Val 


His 


Lys 


Asn 


229 


545 










550 










555 










560 


231 


He 


Ser 


Leu 


Gly 


Ala 


Gin 


Gly 


Asn 


lie 


Asn 


lie 


Thr 


Ala 


Lys 


Gin 


Asp 


232 










565 










570 










575 




234 


He 


Ala 


Phe 


Glu 


Lys Gly 


Ser 


Asn 


Gin 


Val 


lie 


Thr 


Gly 


Gin 


Gly 


Thr 


235 








580 










585 










590 






237 


He 


Thr 


Ser 


Gly 


Asn 


Gin 


Lys 


Gly 


Phe 


Arg 


Phe 


Asn 


Asn 


Val 


Ser 


Leu 


238 






595 










600 










605 








240 


Asn 


Gly 


Thr 


Gly 


Ser Gly 


Leu 


Gin 


Phe 


Thr 


Thr 


Lys 


Arg 


Thr 


Asn 


Lys 


241 




610 










615 










620 










243 


Tyr 


Ala 


He 


Thr 


Asn 


Lys 


Phe 


Glu 


Gly 


Thr 


Leu 


Asn 


lie 


Ser 


Gly 


Lys 


244 


625 










630 










635 










640 
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246 Val Asn lie Ser Met Val Leu Pro Lys Asn Glu Ser Gly Tyr Asp Lys 

247 645 650 655 

249 Phe Lys Gly Arg Thr Tyr Trp Asn Leu Thr Ser Leu Asn Val Ser Glu 

250 660 665 670 

252 Ser Gly Glu Phe Asn Leu Thr lie Asp Ser Arg Gly Ser Asp Ser Ala 

253 675 680 " 685 

255 Gly Thr Leu Thr Gin Pro Tyr Asn Leu Asn Gly lie Ser Phe Asn Lys 

256 690 695 700 

258 Asp Thr Thr Phe Asn Val Glu Arg Asn Ala Arg Val Asn Phe Asp He 

259 705 710 715 ^ 720 

261 Lys Ala Pro He Gly He Asn Lys Tyr Ser Ser Leu Asn Tyr Ala Ser 

262 725 730 735 

264 Phe Asn Gly Asn He Ser Val Ser Gly Gly Gly Ser Val Asp Phe Thr 

265 740 745 750 

267 Leu Leu Ala Ser Ser Ser Asn Val Gin Thr Pro Gly Val Val He Asn 

268 755 760 765 

270 Ser Lys Tyr Phe Asn Val Ser Thr Gly Ser Ser Leu Arg Phe Lys Thr 

271 770 775 780 

273 Ser Gly Ser Thr Lys Thr Gly Phe Ser He Glu Lys Asp Leu Thr Leu 

274 785 790 795 800 

276 Asn Ala Thr Gly Gly Asn He Thr Leu Leu Gin Val Glu Gly Thr Asp 

277 805 810 ^ 815 

279 Gly Met lie Gly Lys Gly lie Val Ala Lys Lys Asn lie Thr Phe Glu 

280 820 825 830 

282 Gly Gly Asn lie Thr Phe Gly Ser Arg Lys Ala Val Thr Glu lie Glu 

283 835 840 845 

285 Gly Asn Val Thr lie Asn Asn Asn Ala Asn Val Thr Leu lie Gly Ser 

286 850 855 860 

288 Asp Phe Asp Asn His Gin Lys Pro Leu Thr lie Lys Lys Asp Val lie 

289 865 870 875 880 

291 lie Asn Ser Gly Asn Leu Thr Ala Gly Gly Asn He Val Asn lie Ala 

292 885 890 895 

294 Gly Asn Leu Thr Val Glu Ser Asn Ala Asn Phe Lys Ala lie Thr Asn 

295 900 905 910 

297 Phe Thr Phe Asn Val Gly Gly Leu Phe Asp Asn Lys Gly Asn Ser Asn 

298 915 920 925 

300 lie Ser lie Ala Lys Gly Gly Ala Arg Phe Lys Asp lie Asp Asn Ser 

301 930 935 940 

303 Lys Asn Leu Ser lie Thr Thr Asn Ser Ser Ser Thr Tyr Arg Thr lie 

304 945 950 955 960 

306 lie Ser Gly Asn lie Thr Asn Lys Asn Gly Asp Leu Asn lie Thr Asn 

307 965 970 975 

309 Glu Gly Ser Asp Thr Glu Met Gin lie Gly Gly Asp Val Ser Gin Lys 

310 980 985 990 

312 Glu Gly Asn Leu Thr lie Ser Ser Asp Lys lie Asn lie Thr Lys Gin 

313 995 1000 1005 

315 lie Thr lie Lys Ala Gly Val Asp Gly Glu Asn Ser Asp Ser Asp Ala 

316 1010 1015 1020 

318 Thr Asn Asn Ala Asn Leu Thr lie Lys Thr Lys Glu Leu Lys Leu Thr 
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PATENT APPLICATION: US/10/092 , 880 



DATE: 03/26/2002 
TIME: 13:16:52 



Input Set : A:\Sequence Listing.txt 
Output Set: N:\CRF3\03262002\J092880.raw 

L:8 M:201 W: Mandatory field data missing, FILE REFERENCE 

L:10 M:270 C: Current Application Number differs, Replaced Application Number 
L:ll M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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